Polar Lattices: Where Ankan Meets Forney 



Yanfei Yan and Cong Ling 
Department of Electrical and Electronic Engineering 
Imperial College London 
London, UK 
Email: {y.yanlO, cling} ©imperial. ac.uk 



Xiaofu Wu 

Nanjing University of Posts and Telecommunications 
Nanjing, China 
Email: xfuwu@ieee.org 



CO 

o 

CN 



C/5 

O 



CN 
> 
O 

On 

in 

CN 

o 

co 



Abstract — In this paper, we propose the explicit construction 
of a new class of lattices based on polar codes, which are 
provably good for the additive white Gaussian noise (AWGN) 
channel. We follow the multilevel construction of Forney et al. 
(i.e., Construction D), where the code on each level is a capacity- 
achieving polar code for that level. The proposed polar lattices are 
efficiently decodable by using multistage decoding. Performance 
bounds are derived to measure the gap to the generalized capacity 
at given error probability. A design example is presented to 
demonstrate the performance of polar lattices. 

I. Introduction 

Polar codes, proposed by Ankan in JT], can provably 
achieve the capacity of binary memoryless symmetric (BMS) 
channels. The authors in |2| and |3| proposed efficient algo- 
rithms to construct polar codes for BMS channels with almost 
linear complexity. Until now, it is still an open problem to 
construct practical polar codes to achieve the capacity of the 
additive white Gaussian noise (AWGN) channel, although an 
attempt from the theoretic side has been made in |0J. 

Lattice codes are the counterpart of linear codes in the 
Euclidean space that are known to achieve the AWGN channel 
capacity, based on the random coding argument J5J. At the 
core of lattice coding is an AWGN-good lattice, whose error 
probability, for infinite lattice coding, vanishes as long as the 
volume-to-noise ratio (VNR) is larger than 1. Such a lattice 
is also referred to as achieving the generalized capacity |]6) 
or the sphere bound Q. Several practical lattice constructions 
have been introduced recently, mostly inspired by low-density 
parity check (LDPC) codes and turbo codes ||8l. lEfl. iflOl, ifTTI 
However, there is no proof that such lattices are AWGN-good. 
This paper will focus on the the design of the AWGN-good 
lattice, while the issue of a finite power constraint can be dealt 
with by the standard shaping technique (see, e.g., 0). 

In this paper, we propose polar lattices to solve both 
problems aforementioned. Polar lattices are explicit AWGN- 
good lattices constructed from polar codes. More precisely, 
we use Forney et a/.'s multilevel construction [7], where for 
each level we build a polar code to achieve its capacity. The 
analysis of Q shows that if the parameters of the component 
lattices are carefully chosen and if the component codes on all 
levels each are capacity-achieving, then the resulted lattice will 
achieve the sphere bound. Following their analysis, we derive 
upper bounds on the gap to the sphere bound and on the error 
probability. The derived performance bounds are computable 
and hold for finite lengths. 
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Fig. 1. Block error probabilities of polar lattices and Barnes-Wall (BW) 
lattices of length N = 1024 with multistage decoding. 

This paper is built on the basis of our prior attempt ifTZl . 
where we followed the structure of Barnes-Wall lattices, but 
changed the Reed-Muller code to a polar code on each level. 
By applying the capacity rule Q rather than the Barnes-Wall 
rule, the performance of our polar lattices has been greatly 
improved, as shown in Fig. Q] The encoding and decoding 
complexity of polar lattices is almost the same as that of 
Barnes-Wall lattices. We use the same multistage decoder for 
both polar lattices and Barnes-Wall lattices. The reason for the 
relatively poor performance of Barnes-Wall lattices is that they 
violate the capacity rule: the rates of some component codes 
exceed the channel capacities of the corresponding levels. 
More discussion will be given later. 

At the final stage of this paper, we became aware of a related 
work 1 13 1 where polar codes were used in multilevel coded 
modulation. However, lTT~3l is not about the construction of 
lattices, and no bounds on the error performance were derived. 

Our paper is organized as follows. Section II presents the 
background of lattice theory and multilevel construction. The 
construction of polar codes for the mod-2 BAWGN channel, 
which serves as a building block for polar lattices, is given in 
Section III. In Section IV, we derive the performance bounds 
and prove that polar lattices can achieve the sphere bound. 
The practical design and simulation results are presented in 
section V. We conclude the paper in Section VI. 



II. Background on Lattices and Multilevel 
Construction 

A lattice is a discrete subgroup of K" which can be 
described in terms of a generator matrix: 

A = {A = Gx : x e Z"}, 

where G is an n-by-n full-rank real matrix. 

Let V(A) be the fundamental volume of A, and a 2 be the 
variance of the AWGN. The VNR of an n-dimensional lattice 
A is defined as 



a 2 (A,a 2 ) 



VW1 
27re<7 2 



(1) 



A sublattice A' induces a partition (denoted by A/ A') 
of A into equivalence groups modulo A'. The order of the 
partition is denoted by |A/A'|, which is equal to the number 
of the cosets. If |A/A'| = 2, we call this a binary partition. 
Let Ai/ ••• /A r _i/A r be an n-dimensional lattice partition 
chain. In the multilevel construction of lattices, for each 
partition Ag/ Ae + i (1 < I < r — 1 a code Ci) over Ae/Ag+i 
selects a sequence of coset representatives ai in a set Af of 
representatives for the cosets of A^+i. The symbols from each 
partition are summed to form a lattice point. 

In this paper, we use the one-dimensional lattice partition 
chain Z/2Z/ • ■ • /2 r_1 Z. In this case, the multilevel con- 
struction which requires a set of nested binary linear codes 
is known as "Construction D". Let the set of nested binary 
linear codes Ci be represented as [N, ke] for 1 < i < r — 1 
and C\ C C2 ■ ■■ C C r -%, The lattice L constructed by this set 
of binary codes is described by the following code formula 



L = Ci + 2C 2 + 



,r-2 



C r - 



2 r ~ 1 Z 



TV 



The fundamental volume of a lattice obtained in this way is 
given by 



V(L) 



rN- 



(2) 



Barnes-Wall lattices constructed from Reed-Muller codes is an 
example of Construction D using the above one-dimensional 
lattice partition. For example, the 1024-dimensional Barens- 
Wall lattice is constructed as: 

BW W 24 = RM(1, 10) + 2RM(3, 10) + ■ • ■ + 2 5 Z 1024 . (3) 

III. Polar Codes for Mod-2 BAWGN Channel 

In this section, we will show how to construct a polar code 
for each level in the multilevel construction of polar lattices. 
The Z/2Z channel is in fact a mod-2 BAWGN channel. The 
signal flow of the mod-2 BAWGN channel is shown in Fig. 
[2] The mod-2 operation is applied within [—1, 1] not [0, 2]. 

A. Channel Capacity of Mod-2 BAWGN Channel 

In 0, the capacity of the mod- A channel and A/ A' channel 
was derived. Therefore we can calculate the capacity of the 
mod-2 BAWGN channel. 
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Fig. 2. Signal flow of the mod-2 BAWGN channel. P{N, k) represents the 
polar code with block length TV and k information bits. 

Let f(x) be the probability density function (PDF) of the 
Gaussian noise x € M" 

n ; n II *ll 2 

Then after the mod-2Z (mod-2) operation we have the density 

fn,a*&) = 2 + x' g [-1,1]. (4) 

The capacity of the mod-A channel is 

C(A,a 2 )=log V(A)-h(A,a 2 ) 1 



(5) 



where h(A, a 2 ) is the differential entropy of the A-aliased 
noise 

MA, a 2 ) = - / f A ,A x ') log hMx'W 

JV(A) 

where V(A) denotes the Voronoi region. 

Furthermore, the capacity of the Z/2Z channel is J7) 

C(Z/2Z) = C(2Z, a 2 ) - C*(Z, a 2 ). 

The channel capacity of the Z/2Z channel is shown in Fig. 
|3] Also shown are 2Z/4Z and 4Z/8Z channels which are also 
mod-2 BAWGN channels but upgraded versions (with smaller 
noise variance) of Z/2Z. For example, coding over 2Z/4Z is 
nothing special compared with coding over Z/2Z. The only 
difference is the signal power is increased by 4. Therefore it is 
equivalent to a Z/2Z channel with the noise variance divided 
by 4. This structure simplifies our task to find good polar codes 
for each channel. We just focus on the Z/2Z channel with 
different noise variances. The above finding also helps us to 
prove the nested structure of polar codes which are constructed 
for these mod-2 BAWGN channels. 

B. Symmetry of Mod-2 BAWGN Channel 

Polar codes are proved to be able to achieve the capacity 
of output-symmetric channels. Therefore we need to show 
this mod-2 BAWGN channel is indeed an output-symmetric 
channel. The output of the mapping operation in Fig. [2] is 
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where the conditional PDF of t can be derived from ©. 
Then the conditional PDFs of y are 

f(y\x = l) = 



l 



f(y\x = o) 
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Fig. 3. Channel capacity of the mod-2 BAWGN channel. The capacity of 
the discrete BSM channel is calculated in Sect. IIII-Cl to show the negligible 
difference between the continuous mod-2 BAWGN channel and the quantized 
discrete channel. 

Therefore the PDFs of the output satisfy 

f(y\o) = /(-y|i). for ail y e [-1,1]. 

Clearly, it is a binary-input output-symmetric channel. 

C. Construction of Polar Codes for Mod-2 BAWGN channel 

Let VF(y|a;) be a BMS channel. Polar codes are block 
codes of length TV = 2 m bits {ui}^L 1 with binary alphabet 
X. Let I(W) be the symmetric capacity of W. Given the 
rate R < I(W), a set of RN rows of the generator matrix 
G = [ \ 0]® m are to be chosen. This gives an TV-dimensional 
channel Wn(Ui\u^). The channels seen by each bit can be 
defined by JT] 

1 



W$(y»,ui- l \ Ui )= J2 



. 2"-iW N (y»\u») 



(i) 

Ankan proved that as TV grows Wpf approaches either an 
error-free channel or a completely noisy channel. The set of 
completely noisy (resp. error-free) bit channels is called frozen 
set A c (resp. free set A). The corresponding bits in {uj}fL 1 
are called frozen bits (resp. free bits). Set Ui = for i £ A 
and only send information bits within A. 

The block error probability Pg can be upper bounded 
by the sum of the bit channels' Bhattacharyya parameters 
as -Pb(Wjv) < Y>i(zAZ(W^). Constructing polar codes is 
equivalent to choosing all the good indices. However, the 
complexity of the exact calculation appears to be exponential 
in the block length. The authors in [0, (2) proposed a practical 
quantization method to control the complexity. We use their 
method to construct polar codes for the mod-2 BAWGN 
channel. The details are given in |[T2l . 

We calculate the capacity of the quantized discrete channel 
which is generated by the Algorithm 1 in |[T2l . We set K = 32 
so that 2K is the number of quantization intervals. Note that 



1 — I(W) is the expectation of the function —x log (x) — (1 — 
x) log (1 — x) over the resulted distribution P x . The results 
are shown in Fig. [3] It is seen that the capacity of this discrete 
BMS channel is almost the same as the theoretic capacity of 
the mod-2 BAWGN channel. 

The approximation loss in the capacity due to the above 
quantization can be bounded in a precise manner. It was shown 
that the merging algorithms of polar code construction in |f3] 
and |2j share the same bound on the approximation loss. 
Denote the upper bound on the probability of error for the 
resulting degraded bit channel W$ by P e (W^\ K). Given 
constant (3 < 1/2, let the rate loss ei oss be defined by 

1 



TV 



{i:P e (W^\K)<2- N '} 



I(W) 



Then, according to Theorem 16 in J3), liminf, 
e(W, K, j3) which is not a function of TV, and 

lim e(W,K,P) = 0. 



(6) 



It means we can get arbitrarily close to the optimal con- 
struction of a polar code if K is sufficiently large. 

IV. Sphere-Bound-Achieving Polar Lattices 

In this section, we will prove the polar lattices can achieve 
the sphere bound. At first we will prove that the polar codes 
used in all levels are nested. Then performance bounds are 
derived, and polar lattices are proved to achieve the sphere 
bound asymptotically. 

A. Nested Polar Codes 

Assume P{N,kg) (1 < £ < r — 1) is the polar code 
constructed at the £-\h level. We select the channel indices 
according to the Bhattacharyya parameters. The following 
lemma shows that the polar codes in the multilevel construc- 
tion are nested. This is a requirement of Construction D. 

Lemma 1: Polar codes constructed in the multilevel con- 
struction are nested; i.e., P(N,k\) C P(N,k 2 ) C • • • C 

P(N,kr-l). 

Proof: By Lemma 4.7 in Q3J, if a BMS channel W is a 
degraded version of W, then is degraded with respect to 



W$ and Z(W^') > Z{W^'). Let the target error probability 
be P e = 2~ N . The codewords are generated by x N = u^Ga, 
where Ga is the submatrix of G formed by rows with indices 
in the free set A. The free sets for these two channels are 

={* : Z{W$>) < P e }, 



r(<) 



A 



A ={i : Z(W$) < P e }. 



Due to the fact that Z{W^ ] ) > Z(W^'), A C A. If we 
construct polar codes P(N, A) over W and P(N, A) over W, 
G A is a submatrix of G A . Therefore P{N,A) C P(N,A). 

For the one-dimensional lattice partition, the 2^~ 1 Z/2^Z 
channel with noise variance a 2 is equivalent to the 2 lr Lj2 l+lr L 
channel with noise variance 4cr 2 for 1 < I < r. Therefore 
the channel of the ^-th level is degraded with respect to the 
channel of the (I + l)-th level, and consequently, P(N, kg) C 

P(N,k e+1 ). m 



B. Performance Bounds 

As we use a multistage decoding algorithm, the overall 
(block) error probability P e {L, a 2 ) of a polar lattice is upper- 
bounded by the sum of the block error probabilities at indi- 
vidual levels, by the union bound. The block error probability 
of the polar code for each level is denoted by Ps(Ce, a 2 ) for 
(1 < £ < r - 1) and by P B (Af , a 2 ) for the r-th level. Then 

r-l 

Pe(L 7 a 2 ) < ^P fl (C,,<7 2 ) + P B (A* a 2 ), (7) 



From 1151 we can derive the bound e\ < eA 1 (cr)log 2 e 
where (a) is the flatness factor. We may choose the lattice 
Ai sufficiently fine so that ca^ct) is negligible. 

By the union bound, we may choose V(A r ) large enough 
that P B (A^a 2 ) 0. 

From (O, e 3 is the sum of the rate losses 



where the error probability of A;, is given by 



P s (A?,a 2 ) = l 



f a 2(x)dx. 



(8) 



As shown in the previous subsection, we can construct 
nested polar codes with rates k(C() = % = C(Ae/ A(+±, a 2 ) — 
eioss(^) such that the block error probability in each level 
Pb(C^,ct 2 ) is upper bounded by 2~ Nf>t , for any fig < h. 

The volume of the lattices constructed by Construction D 



V(L) = 2 & V(A r ) N . 
Therefore the logarithmic VNR of L is 



(9) 



log 2 a 2 {L,a 2 ) 



log 2 



V{L)tt 
2nea 2 
-2 r E K, 



log 2 



V{A r f 



2ixea 2 



Define 



-2 V" Kt + 2 log 2 V(A r ) - log 2 2irea 2 



f ei = C(A 1 ,<j 2 ), 
e 2 = h(a 2 )-h(A r ,a 2 ), 

r-l 

e 3 = C"(Ai/A r ,cr 2 ) -J2 K £> 



where h(a 2 ) = ^ log 2 27recr 2 which is the differential entropy 
of the Gaussian noise. 
Then we have 

log,^ 2 ^^ 2 ) = 2(log 2 F(A r )-C(A 1 /A r ,a 2 ) 

+ £ 3 ~ 2 l0g2 27re<j2 ) 

= 2(log 2 V(A r ) - log 2 V(A r ) + h{A r , a 2 ) 

+ £l +£3- 2 l0 g2 27recr2 ) 

= 2(ei-e 2 + e 3 ). 

It was shown in Q that e 2 ~ nP e {A r ,a 2 ), which is 
negligible compared to the other two terms. We can drop the 
term e 2 such as to obtain an upper bound 

log 2 a 2 (i,a 2 )<2( ei +e 3 ). (11) 



£3 = y^Qossffl- 

i=l 

We have 

r-l 

lim lim e 3 = lim V" e(Ag/At + i, K, f3 e ) = 0, 

K— ¥00 N— >oo fr_v^, ^— — ' 



£=1 



and the sum error probability of polar codes is upper bounded 
by 



r-l 



r-l 



which can be made arbitrarily small by increasing the block 
length of polar codes N . 

In conclusion, we have the following theorem: 

Theorem 1: The error probability of polar lattice L is 
bounded by 



(10) 



r-l 



P e (L,a 2 ) <J2 N2 



1=1 



+ N\1- I f a 2(x)dx , (12) 

V(A r ) / 



with the logarithmic VNR bounded by (fTTT i. In the limit 
as r, K, N — > 00, L can achieve the sphere bound, i.e., 

a 2 (L,a 2 ) -> 1 and P e (L,a 2 ) -> 0. 

Remark 1: Note that the integral in dT~2T > can simply be 
represented by the Q-function, since A r is a scaled version 
of Z. 

V. Practical Design and Simulation Results 

We give a design example of polar lattices in this sec- 
tion. The design follows the capacity rule. We use the one- 
dimensional lattice partition Z/2Z/ • • • /2 r ~ 1 Z. We suggest 
using a not VNR as the reference parameter to choose 
component codes. This is because a can be calculated in each 
level and can fully represent the channel. 

One can determine the number of valid levels according to 
the capacity calculation as shown in Fig. [3] If the actual rate 
of a level is almost 1 and still can achieve the target block 
error probability Pb for the reference a, it is unnecessary. 
This is because the rate from this level will be canceled by 
5 log 2 27re<7 2 . The effective levels are those with an actual 
rate visibly within (0, 1). In other words, adding levels whose 
capacities are close to 1 or do not noticeably change the 
VNR. 

Therefore, we choose two levels, which was also suggested 
in 0. The code formula of the polar lattice is 



P(N, h) + 2P{N, fc 2 ) +4Z 



JV 




Fig. 4. A polar lattice with two levels. 

where P(N, k) represents the polar code. The multilevel 
construction and the multistage decoding are shown in Fig. 
ID <7j means the standard deviation of the mod-2 BAWGN 
channel for the z-th level. Note the difference of a between 
each level is only a factor of 2. 

We start from the bottom level to the top level. The bottom 
level is a 7L N lattice decoder. First, from ((8), if we set the 
target error probability to 10 -5 , 03 ps 0.08719. Then, move 
to the middle level. cr 2 = 2 * er 3 = 0.1744. From Fig. |3] the 
channel capacity of the middle level is C(Z/2Z, 02) = 0.9830. 
Finally, the capacity of the top level is 0.4775 as o\ = 0.3488. 
Our goal is to find two polar codes with rates close to their 
respective capacities and block error probabilities < 10~ 5 over 
the mod-2 BAWGN channel. 

From (fTTT i. 

log 2 a 2 (L, a 2 ) = 2 (\og 2 V(A r ) -R-\ log 2 2neo{\ , (13) 

where R is the sum rate of the component polar codes. 

From our simulations, for N = 1024, we found that 
P B (^ = 0.22, a\) ps 2xl0~ 5 , P B (^ = 0.9,of) ps 2x10~ 5 
and P s (Af ,ct|) ps 1 x 10~ 5 . Therefore R = 0.22 + 0.9. 
According to (0 and (foi l. 

P a (£,<r?)w5x 10" 5 , (14) 

and 

10 log 10 a 2 (i, a\) = 2.12 dB, (15) 

which is not surprisingly very close to our simulation shown in 
Fig. Q] We use belief-propagation (BP) decoder to decode the 
component polar code for each level. This simulation implies 
that the performance of the component codes is very important 
to the multilevel lattice codes. The gap to the sphere bound is 
largely due to the rate loss of polar codes. 

Now we go back to the Barnes-Wall lattices. According 
to Fig. |3] there are only 2 valid levels, but the Barnes-Wall 
lattice © has 6 levels for N = 1024. The rate of the first 
level is 0.01, which is much bigger than the capacity of the 



first level when VNR is low. Another reason is the relatively 
weak error-correction ability of Reed-Muller codes. Therefore 
the error probability of the first level will be very high in the 
low VNR region as shown in Fig. Q] 

VI. Conclusions 

In this paper, we have followed Forney et al.'s multilevel 
approach to construct polar lattices. The channel capacity of 
each level is calculated and a polar code is constructed to 
achieve its capacity accordingly. Since polar lattices are as 
explicit as polar codes, their construction is equally efficient. 
Compared with existing schemes J8), ©, iflOl . ifTTl . polar 
lattices are distinguished by their provable AWGN-goodness 
and low complexity, namely, they asymptotically achieve the 
sphere bound with multistage decoding. 
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